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Coding and Decoding for the Dynamic Decode 
and Forward Relay Protocol 

K. Raj Kumar and Giuseppe Caire 
Abstract 

We study the Dynamic Decode and Forward (DDF) protocol for a single half-duplex relay, single- 
antenna channel with quasi-static fading. The DDF protocol is well-known and has been analyzed in 
terms of the Diversity-Multiplexing Tradeoff (DMT) in the infinite block length limit. We characterize 
the finite block length DMT and give new explicit code constructions. The finite block length analysis 
illuminates a few key aspects that have been neglected in the previous literature: 1) we show that one 
dominating cause of degradation with respect to the infinite block length regime is the event of decoding 
error at the relay; 2) we explicitly take into account the fact that the destination does not generally know a 
priori the relay decision time at which the relay switches from listening to transmit mode. Both the above 
problems can be tackled by a careful design of the decoding algorithm. In particular, we introduce a 
decision rejection criterion at the relay based on Forney's decision rule (a variant of the Neyman-Pearson 
rule), such that the relay triggers transmission only when its decision is reliable. Also, we show that a 
receiver based on the Generalized Likelihood Ratio Test rule that jointly decodes the relay decision time 
and the information message achieves the optimal DMT. Our results show that no cyclic redundancy 
check (CRC) for error detection or additional protocol overhead to communicate the decision time are 
needed for DDF. Finally, we investigate the use of minimum mean squared error generalized decision 
feedback equalizer (MMSE-GDFE) lattice decoding at both the relay and the destination, and show that 
it provides near optimal performance at moderate complexity. 
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I. Introduction 

Employing multiple antennas at the transmitter and the receiver of wireless communications is known 
to provide significant benefits in terms of both throughput (multiplexing gain) and reUability (diversity 
gain) (see [13] and references therein). When physical constraints limit the number of antennas that can 
be installed on a single wireless device (e.g., small sensors in sensor networks), the usage of cooperative 
wireless relay protocols is a promising alternative strategy. In these protocols, two or more terminals 
cooperate in order to mimic a super-user with multiple antennas. 

The relay channel was introduced by van der Meulen [2] and was studied in detail by Cover and 
El Gamal [3], who characterized the capacity for the discrete memory less as well as for the Gaussian 
degraded cases. The relay channel with fading was examined by Sendonaris et al., [4], where an achievable 
rate region was provided. In the case of slow fading, the outage behavior of half-duplex wireless relay 
chaimels was studied by Laneman et al., [5], and simple cooperative diversity protocols for signalling 
across these chaimels (such as amplify and forward and decode and forward) were introduced. In [15], 
Azarian et al. used the diversity-multiplexing tradeoff (DMT) formulation of [13] to study the outage 
behavior of slowly-fading relay chaimels in the high-SNR regime, and also introduced new classes of 
protocols such as the non-orthogonal amplify and forward (NAF) and the dynamic decode and forward 
(DDF). An improved DDF protocol based on code superposition was later proposed in [30]. The DDF 
protocol for the single relay case was subsequently studied in [16], where simplified variants of the 
protocol were introduced and some code design issues were addressed. Code design for the DDF protocol 
is also addressed in the recent contribution [29]. 

The present paper also focuses on the DDF protocol for the half-duplex, single relay single-antenna 
case. With respect to [16] and [29], we analyze explicitly the achievable DMT of practical codes with 
finite block length and propose a simple DMT optimal code construction that makes use of approximately 
universal codes for the parallel channel and of the Alamouti code. Approximately universal codes for 
the parallel channel may be obtained either from using a QAM base alphabet and a suitable unitary 
precoding matrix (lattice codes) or from permutation codes derived from universally decodable matrices 
(UDM) [21], [22]. We treat both cases and give construction examples and comparisons. Remarkably, 
our codes perform very close to the outage probability and have generally lower decoding complexity 
than those previously proposed. 

Furthermore, we discuss two often neglected issues: 1) the effect of decoding errors at the relay, and 
how to mitigate it; 2) the fact that the destination does not generally know a priori the relay decision 
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time. In order to tackle 1), we introduce a decision rejection criterion at the relay, such that the relay 
triggers transmission only when its decision is reliable. We show that the Forney's decision rule (a 
variant of Neyman-Pearson rule) yields almost optimal performance with practical finite length codes, 
while previously proposed options suffer from significant degradation. In order to tackle 2), we treat 
the channel "seen at destination" as a compound channel, where each compound member corresponds 
to a different relay decision time. We prove that a receiver based on the Generalized Likelihood Ratio 
Test (GLRT) rule, that jointly decodes the relay decision time and the information message, achieves the 
optimal DMT. We also show that a simpler scheme that performs separate detection of the relay decision 
time, by ignoring the structure of the coded signal and treating it as random, is generally suboptimal and 
it becomes optimal only in the limit of infinite block length. As an aside, our results show that no side 
information channel or additional protocol overhead is needed in order to inform the destination about 
the relay decision time. This may yield to much simplified actual protocol design for the DDF scheme, 
at the cost of an augmented decoder at the destination. 

With the lattice codes advocated in this paper, the decoder at the relay has to solve a closest lattice 
point problem with a rank deficient lattice matrix. It is well-known that standard sphere decoding [6], 
[7] yields exponential complexity in this case. In order to address this problem (again, often neglected 
in the current literature) we advocate the use of the minimum mean squared error generalized decision 
feedback equalizer (MMSE-GDFE) lattice decoder of [32], [34]. Via simulation of the performance of 
our explicitly constructed codes, we demonstrate that this lattice decoder is able to provide near optimal 
performance at moderate complexity. 

In Section In) we introduce the system model we work with and review relevant previous results. 



Section III presents the main result of the present paper, a characterization of the DMT of the DDF 
protocol for finite block length. Explicit code constructions that achieve this DMT are provided in 



Section IV and methods to enable error detection at the relay and low complexity decoding of these 
codes are also dealt with. 



II. Problem definition and background 

A. System model 

We consider the single relay channel shown in Fig. [1] where S, R and D denote the source, relay and 
destination, and h, gi and g2 denote the fading coefficients between the source -relay, source-destination 
and relay-destination terminals, respectively. 
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The channel fading coefficients are i.i.d. C3Sf(0, 1) random variables, corresponding to i.i.d. Rayleigh 
fading. Following the standard outage setting [5], [15], [13], we assume that the channel coherence time 
is considerably larger than the allowed decoding delay. Invoking a time-scale decomposition argument 
(see for example [18]) this setting is modeled by the so-called quasi-static fading channel, where the 
channel coefficients are random but remain constant over the whole duration of a codeword, although 
the latter can be very large. We consider slotted transmission where a source codeword spans M slots 
of length T symbols each, resulting in a total block length of MT. 

The relay operates in half-duplex mode. In decode and forward protocols, the block of length MT 
symbols is split into two phases. In the first phase the relay is in listening mode and receives the signal 
from the source. At a certain instant, referred to as the decision time in the following, the relay tries to 
decode the source information message. In the second phase, from the decision time to the end of the 
block, the relay switches to transmit mode and sends symbols to help the destination decode the source 
message. The DDF protocol is characterized by the fact that the decision time is not fixed a priori. On 
the contrary, the relay decides when to decode and switch to transmit mode depending on the channel 
coefficient h and the received signal. Therefore, the decision time is a random variable M. Without loss 
of generality, we restrict the decision time to coincide with the end of a sloQ i.e., M takes on values 
in the set {1,2,...,M}, where M = M corresponds to the case where the relay does not help the 
destination. During phase 1 (listening phase) the signal received by the relay is 

yr,k = hxs,k + Vk, A; = 1, 2, ... , MT, (1) 

'Notice that T is a design parameter. Letting T = 1 provides an unrestricted decision time. In thiis way, there is no loss of 
generality in this assumption. 
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and the signal received by the destination is 

Vk = giXs,k + Wk, A; = 1, 2, ... , MT. (2) 

During phase 2 (relay transmit phase), the signal received by the destination is 

Vk = 9ixs,k + 92Xr,k + Wk, k = MT+l,MT + 2,...,MT. (3) 

Here, = [xs,i ■ ■ ■ Xs,mtV denotes the source codeword, drawn from a code Xg C of rate R bits 
per symbol. Without loss of generality, we may assume that the symbols Xr,k transmitted by the relay 
are from an auxiliary code C C^"^ with rate R and block length MT, but only the last (M - M)T 
symbols of a codeword are effectively transmitted in phase 2, while in phase 1 the relay transmitter is 
idle because of the half-duplex constraint. 

The noise at the relay and destination, denoted by vj- ~ C3Nf(0, a^) and Wk ~ C3sr(0, cr^), form two 
white mutually independent sequences. We impose the same per-symbol average power constraint for 
both the source and the relay, given by 

E [\xs,k\^] , E < E, 

where E denotes the symbol energy, and define the SNRs of the S-D and the S-R links to be p = E/a"^ 
and p' = E/al, respectively. 

For later use, we introduce the following notation: let yj, y^j, x-^ ^ and x^^, each G C*^-' *^^, denote 
respectively the received signals at the destination and at the relay from symbol time + 1 to jT, the 
source transmit signal from time zT + 1 to jT and the relay transmit signal from time + 1 to jT, 
where the latter is assumed to be zero for all times k < MT. The quantities wj and are defined 
similarly. 

B. Diversity-Multiplexing Tradeoff 

A compact and convenient characterization of the tradeoff between rate and reliability of quasi-static 
fading channels in the high-SNR regime is provided by the DMT introduced in [13]. In this framework, 
rate and reliability are quantified in terms of the diversity gain d and spatial multiplexing gain r. A 
family of coding systems, each of which operates at SNR p with rate R{p) and error probability Pe{p), 
achieves a point (r, d) on the DMT plane if 

lim = r, lim — ^ — = —d. 

p— >oo log p p—^oo log p 

This latter relation is written as Pe{p) = p~'^ in the exponential equality notation of [13]. 



February 4, 2008 



DRAFT 



SUBMITTED TO IEEE TRANS. INFORM. THEORY, JAN. 2008 



6 



We will use the DMT as our performance metric when we analyze cooperative diversity protocols. It 
is clear that the DMT of the MIMO channel with one receive and two transmit antennas provides an 
upper bound to the performance of any relay protocol for the channel of Fig. [T] This bound, known as 
the transmit diversity bound [5], is given by 

dtx.div.bd.(r) = 2(1 - r). 

The DMT of the DDF protocol, proposed and analyzed in [15], is given by 

f 2(1 -r), < r < i 
[ (1 - r)/r, \<r <l 

This result is obtained by analyzing the information outage probability with Gaussian inputs, and it is 
achievable (e.g., by using a Gaussian random coding argument) in the limit of both M — > oo and T ^ oo. 
The relay decision time is given by 

M = min |m, ■ ..^^^12 M I > (5) 

i.e., M is set to the minimum m = 1, 2, . . . , M — 1 such that the mutual information between x^q and 
yj!}) for fixed and known h, given by mTlog(l + exceeds the number of information bits per 

message MTR. If such an m exists, the relay triggers the decoding of the whole information message 
and switches to the transmission mode. If no such m exists, then M = M and the relay remains silent. 
Both the limit of large M and T are necessary to achieve the DDF DMT in (|4]). In fact, the normalized 
decision time M/M must converge to a continuous random variable distributed in [0, 1] and, for every 
decision time M = m, the number of symbols rriT received by the relay must be arbitrarily large, 
such that the decoding error event coincides with the information outage error event. In this way, the 
corresponding probability of decoding error is arbitrarily close to the information outage probability 



2 M / MR\ 



P log(l + |/i|V) < 

m 



and the probability of undetected error (i.e., the relay accepts a wrong decision) is arbitrarily small. In 
brief, T — > oo is necessary in order to fix the optimal decision time based only on the channel strength 
and be sure (with arbitrarily high probability) that the decoded message is the correct one. 
We should also notice that, in the limit of T ^ oo, the outage probability does not depend on the 
knowledge of h at the relay decoder and of (51,(72) at the destination decoder (see for example [8]). 
On the other hand, a common assumption made in previous works is that the destination knows exactly 
the relay decision time M. In practice, this assumption requires some form of protocol to provide side 
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information to the destination. In the DMT analysis, one should pay great care to ensure that the error 
probability of such side information protocol does not dominate the decoding error probability, i.e., in 
designing any side information protocol we must ensure that its probability of error decreases not slower 
than 

Practical code design for the DDF protocol considers finite, possibly very short, M and T. In the 
following, we will make an explicit assumption of perfect receiver channel state information (CSIR), 
that is relatively easy to acquire using pilot symbols and is a common assumption in the DMT analysis 
of even finite-length codes (see [13] and [18]). On the contrary, we explicitly address the fact that the 
destination does not know a priori the relay decision time M and tackle this problem by analyzing an 
augmented decoder based on the GLRT rule. 

C. Existing DDF code designs 

In [16], a variant of the DDF protocol is proposed where the relay code is such that the signal 
received at the destination reduces to an Alamouti constellation [10]. We will refer to this scheme as the 
"Alamouti-DDF" scheme, and review it briefly in the sequel since we make use of the same approach. 
With the Alamouti-DDF, assuming that the relay decodes correctly at the decision time M = m, the 
signal transmitted by the relay at time k is given by [16] 

k = mT+l,mT + 3,... 

Xr,k = < , (6) 

k = mT + 2,mT + 4,... 

which reduces the signal seen by the destination for mT + 1 < k < MT to an Alamouti constellation. 
Through linear processing of the received signal y^'^ , the destination obtains the sufficient statistics for 
decoding, given by 

giXs,k + Wk, k = l,...,mT 

Vk = { , (7) 

+ \92\^Xs,k + Wk, k = mT+l,...,MT 

where the statistics of Wk are identical to those of Wk- In this case, it is easy to see that the mutual 

information per symbol at the destination, for M = m and i.i.d. Gaussian inputs, is given by 

— log (1 + V) + log (1 + + \92\^)P) (8) 

and coincides with that of the original DDF scheme defined by ([2]) and when the codebooks and X,. 
are also drawn independently from an i.i.d. Gaussian ensemble. Hence, the Alamouti-DDF modification 
entails no loss in DMT compared to the original DDF protocol [16]. 
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III. DMT OF THE DDF Protocol with finite length 

In this section, we characterize the achievable DMT of the DDF protocol with finite M and T. First, we 
find an upper bound on the DMT by letting T ^ oo, assuming that the destination has perfect knowledge 
of the relay decision time M, and using outage probability. Then, we shall analyze the performance of 
Gaussian random codes with finite length, with the assumption that the destination has no knowledge of 
M, and find a lower bound that matches the upper bound. 

Since for i.i.d. Gaussian inputs the Alamouti-DDF yields the same mutual information as DDF, as far 
as outage probability is concerned we can refer to the channel defined in ([7]). This is a set of parallel 
channels for m = 1, . . . , M, with dependent channel gains. In particular, there are two types of sub- 
channels: one representing the S-D link, and another set representing the composite (S,R)-D link (except 
for the case when m = M, which corresponds to when the relay remains inactive for the whole block; 
in this case, only the S-D link appears). The switching point between the two channels is controlled 
by the random variable M. We will refer to this channel as a random switch channel (RSC). Given a 
particular switching instant M = m, we will call the ensuing channel as a m-switch channel (m-SC). 
The RSC belongs to the class of "mixed channels" (see [9]), that is, a compound channel with an a 
priori probability distribution on the compound members. In this case, the probability distribution on the 
channel members (the m-SCs in dv])) is induced by the triple (M, 51,52)- 



A. Outage probability analysis 

We compute the DMT of the RSC defined above for arbitrarily large T under the assumption that the 
destination receiver has perfect knowledge of M, and hence find an upper bound on the DMT exponent 
d*^,j{r) for the finite-length DDF protocol. This is established by the following theorem. 

Theorem 1: The DMT of the single relay DDF scheme with decision times m = 1,2,...,M and 
finite slot length T > 1 is upper bounded by 

d*M{r) < dout{r) = min {dm{r) + dm{r)} , 

l<m<M ^ ^ 



where 

~d„,{r) 



Mr < r < 



m-1 ' " — ' — Af 

0, ^<r<f , (9) 
§<r<l 



. 2 - 2r, m< ^ 

dm{r) = { ' (10) 

m ' — 2 
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for r > ^, and 



dm{r) 



2r, 

tM 



m < 



M 



M-m ' 
M(l-r) 



M_ 
2 



<m< M(l - r) 
m > M(l - r) 



(11) 



for r < 2- 



Proof: Let M denote the random decision time as defined in (jsjl and Pout{f) denote the outage 
probabiUty of the corrsponding RSC. Also, let P^t^'"{r) denote the outage probability of the m-SC for 



given m. Then, the law of total probability yields 



M 



(12) 



m=l 



Since in the regime of very high SNR that characterizes the DMT, scaling SNR by a constant does 
not change the DMT, we allow both p, /?' — > oo and the DMT shall not depend on the (constant) ratio 

P'/P = (^1/(^1 Define 



Pout{r) 



P(M = m) 



p-d^{r)^ 1 < m < M, 
p-d^{r)^ 1 < m < M. 



Then, it is clear from ( 12i that 



mm 



{dm{r) + dmir)} . 



l<m<M 

Furthermore, from standard arguments based on Fano inequality [13] and because here we are assuming 
that the destination receiver is enhanced by the side information on M, it is also immediate to conclude 
that dlj{r) < dout{r). 

It remains to prove (jojl and ( [TO] ), ( [TT] ). Notice that dm{r) is solely a function of the S-R link and dm{r) 
is a function of the R-D and S-D links. We analyze these quantities separately as follows. 

1) Analysis of p^*^™ ('').- Let's consider first the case m < M. Set R = rlogp. The probability that 
the relay decodes after m sub-blocks P(M = m), 1 < m < M — 1, corresponds to the event 

{mriog(l + \hfp') > MKT > (m - l)riog(l + \h\'^p')} 
^ {^logp < log(l + \h\'p') < ^logp} 



44> 



■-1 



<\h\'< 



p' 



(13) 
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Since is exponentially distributed and p' = p, we compute 



Mr _^ 

P(M = m) = / e'^dz 



According to the value of the multiplexing gain, we analyze the above quantity for each 1 < m < Af as 
follows. 



r > 



A/- 



This corresponds to — — 1, — 1 > 0. In this casq^ 



m—l 



P{M = m)= p- 



m—l 
M 



< r < 



M- 



This corresponds to ^ — 1 < 0, — 1 > 0. In this case, 



P(M = m) = p° 



r < 



m—l . 
M ■ 



This corresponds to ^ — 1 < 0, ^^j^ — 1 < 0. In this case, using a power series expansion. 



Mr 



P(M = m) 



Mr- 1 P 



2(^-1) 



1 p 

1-/9 — 1 ^ + - 



2! 



+ 



2! 



+ • 



A similar analysis for P(M = M) results in 



P{M = M} = 



Therefore, the result for all 1 < m < M can be compactly expressed by ([9]), shown in Fig. [2] 
2J Analysis of dm{r): From ([8]l, the outage probability of the m-SC is given by 

P^t^^{r) = P (J™-^^ < MTR) , 



^The notation P ^ p °° indicates tiiat P decreases faster tiian any polynomial function of p. 
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dm{r) 




m— 1 m 
M M 



1 r 



Fig. 2. Negative p-exponent of the probability of tiie relay decoding after exactly m-subblocks. 



where a™"^*^ = mriog(l + \gi\'^p) + (M - m)Tlog[l + (|c/ip + |52p)/o]- Defining |c/ip = p'"^ and 
1(72^ = /0~"^ and applying standard approximations in the regime of large p, we eventually obtain 

^o"r'^(0 = P ((M - m) max{[l - ai] + , [1 - a2]+} + m[l - ai]+ <rM), 

where = max{0,x}. Since \gi\'^ and |(72p are independent exponential random variables, the joint 
pdf of (a 1,02) is given by 

f{ai,a2)=e-P''"~P"''p'^^-''\ 



Therefore, 



P^ut'^^ir) = [ p-^^-^^da, da,, 



where 23 is the two-dimensional region defined by the inequalities {M — m) max{[l — ai] + , [1 — 02]+} + 
m[l — q;i]+ < rM and > V i. 

Using Varadhan's lemma [17], we obtain 

dm{r) = inf {ai + 02} • (14) 

Define /? = The region B is equivalently defined by 

(1 - 13) max{[l - ai]+, [1 - a2]+} + - ai]+ < r, 
a,- > V i. 



It is obvious that we may restrict attention to Oj < 1 V i insofar as computing the infimum in ( 14 1 is 
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C(2 




Fig. 3. The region B. 



concerned. We analyze "B according to the following cases: 

• ai > a2'. 
We have 

/5(l-ai) + (l-/5)(l-a2) <r 
^ I3ai + (1 - /3)a2 > 1 - r. 

This line has intercepts and on the ai and 02 axes respectively. 

• tti < 02: 
We have 

/3(l-ai) + (l-/5)(l-ai) <r 
<^ ai > 1 - r. 

The region S is depicted in Fig. [3] The solution to the problem in Q corresponds to choosing the least 
non-negative k such that the line ai + a2 = k touches H. The analysis should be done according to 
whether ^2 — 2r44>/3^2 and whether 2 — 2r ^ 1 44> r ^ 2- It is immediate from Fig. jsjthat 
the solution to ([14]) when r > ^ is at {a\,al) = (^,o] for /? > 0.5, and {a\,al) = (1 - r, 1 - r) 



for [} < 0.5. For the case when r < ^, the solution to ( 14l is at (a^, Og) = (1 — r, 1 — r) for /3 < 0.5, 
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at {al,a*2) = - j^) for (3 > 0.5 and ^ > 1, and at (a^,a^) = (V'^j ^ - 
< 1. The final solution is compactly expressed by ( [TO] ), ( [TT] ). 
This concludes the proof of Theorem [T] ■ 

B. Achievability 

We consider finite length T and no a priori knowledge of M at the destination decoder. We have the 
following result: 

Theorem 2: The upper bound of Theorem [T] is achievable. Therefore, d\.j{r) = dout{f)- 

Proof: We consider the original DDF protocol (not the Alamouti variant) defined by ([T]), (|2]l and 
([3]). For this channel we construct a particular coding scheme and analyze its performance. 

Codebook generation: For given M, T and R, we generate C C and C C of cardinality 
prMT independently, with i.i.d. components ~ C]M"(0, E). We let Xs(a;) and Xr(w) denote the codewords 
in Xs and in Xr, respectively, corresponding to the information message u) G {1, • • • ,p^'^^^}. 

Relay decoding: We define the relay outage event at slot m as 

Om=|/^eC:|/i|^< ^'"^7^ | (15) 

Differently from the case of arbitrarily large T, the relay may decode in error at time m even though 
h ^ Om- In the presence of such undetected error the relay would switch to transmit mode and send a 
codeword corresponding to an incorrect information message, thus jamming the destination receiver. In 
order to avoid this event we consider a bounded distance relay decoding decision function ips defined as 
follows (see [12]): for m = 1, . . . , M — 1, define the regions %m{<^) of all points y G C'^'^ for which 
uj is the unique message that is contained in a sphere of squared radius mT{l + 5)a1 centered at y, 
i.e., |y - /ix™o(cj)|2 < mT{l + 5)al. Then, let iJsiy'^Q, /i) = D G {1, . . . , /*^^} if both the following 
conditions are satisfied: 

1) hiOrn, 

2) y-o G §„(D); 

(the relay has perfect knowledge of its own channel coefficient h, by the perfect CSIR assumption). If 
these conditions are satisfied, then M = m and the relay switches to transmit mode, sending the signal 
xj^(D) for the remaining part of the block. Otherwise, it refrains from making a decision and waits for 
the next slot. 
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It should be noticed that the condition 2) above is a test on the typicaUty of the estimated channel 
noise. In fact, if lo is the transmitted message, we have that 

|y-o-/ix-MP = |vo"^P 

is a central chi-squared random variable with 2mT degrees of freedom and mean mTa'^, that provides 
an empirical estimate of the noise variance. 

Destination decoding: The destination is not aware of the relay decision time M. Hence, it makes 
use of an augmented decoder that simultaneously detects the decision time and the information message 
according to the GLRT rule: 

{u),m} = argmaxp (yo^|u;, m, 51, 9^2) • (16) 

where p{yQ^\uj,m, gi, §2) is the decoder likelihood function, i.e., the pdf of the signal received by the 
destination over the whole block length, under the hypothesis that the source transmitted the information 
message lu, that the relay decision time is m, and given the channel coefficients 91,52 (recall that we 
assume perfect CSIR). 

Error probability analysis: Let £ denote the decoding error event at the destination and £r denote the 
decoding error event at the relayj^We can write 

M 



P(£) = ^P(M = m)P(£|M = m) 

m=l 
M 

= P(M = m) (P(£, £,.|M = m) + P(£, fi^jM = m)) 

171=1 

M 

< P(M = m) (P(£^|M = m) + P(£|£r, M = m)P(Er\M = m)) 



M 



< ^P(M = m)(P(£^|M = m)+P(£|£r,M = m)). (17) 

First, we bound the effect of the undetected decision error at the relay. Our analysis follows closely the 
analysis of the MIMO-ARQ scheme in [12]. In fact, the relay applies a scheme very similar to ARQ: 
when it is sure about its decision it stops receiving and starts transmitting, while if it is not sure about 

^The complement of an event A is denoted by A. 
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its decision it waits for the next slot. We have 



< P{W^\'>mT{l + 5)al) 

< (1 + <5)'^^e-""^^ 



(18) 



where the last line follows from the Chemoff bound on the tail of the chi-squared distribution. Letting 

5 = ji log p, we find 

P(£^|M = m) <p-""^^. 

Notice that P(£|£r,M = m) > where dm{r) is the exponent of the information outage 

probability of the m-SC channel given in ( [T0| ), ( [TT] ) and is not larger than 2. Hence, it is sufficient 



to choose i^iT > 2 in order to make the terms P(£r|M = m) exponentially irrelevant in (17) 



Next, let us examine the probabilities P(M = m). Let Um = U^=i ^mi'^) denote the subset of the 
relay channel output space C™^ such that if y™Q G Um then there exists a unique codeword within the 
bounded distance decoder's decoding sphere centered at y™Q. For m = 1, we have 

P(M = 1) = P({/i^ Oi},{yJ,oeUi}) 

= n-Mr) 



(19) 



For brevity we let Dm = {h ^ O-m} n {y,^o £ Um}. Then, for 1 < m < M, we have 

P{M = m) = P{Vi,...,Vm^i,Vm) 
< P{Vm-l,Dm)- 



(20) 



For 1 < m < M, from ( 20 1 we can write 



P(M = m) < p({{/ieOm_i}U{y:;},-i ^U„_i}},{/i^O„_i},{y™o^lt 

< P({/iGOm-i},{/i^Om}) + P({/i^Om-i},{y™o"'^^m-i}), (21) 

where the second inequality follows from the fact that for events A,B,C and D, we have using the 
distributive law and the union bound that 

P{{AuB}n{CnD}) = P{{Au{BnA)}n{CnD}) 

< P{AnC) + P{BnA). 
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Finally, for m = M, we have 

P(M = M) = P (©!,..., ^M-l) 

< P{Vm-i) 

= P({/l^OAf-l},{y^?o~'^UM-l})+P(/jGOA/_i). (22) 



We notice that the event {h G Om-i} n {ft, ^ Om} coincides with (13 1 and therefore the first term in 
(21 1 decreases as p^'^^(^\ It is also immediate to see that P {h ^ O^j^i) = Hence, we are 

left with the analysis of the probability 

P({/i^O„},{y,"},^U„}) (23) 

for all m = 1, . . . , M — 1. Averaging with respect to the random coding ensemble, we may choose 
without loss of generality = 1 as the reference transmitted message. We have 

Um Q {|vS^|2 > mT{l + 6)al} U 

where Olm{l) are the points y^P-Q such that |y™Q — /ix^q(1)P < mT(l + 6)a^, and there exists some 
w / 1 for which also {y'^^Q - /ix™o(a;)|2 < mT{l + S)al Letting for brevity Ax(tj) = x™o(a;) - x^o(l), 
we can write 

^m{l) = U {l^" - < mTH + 5)al \wZf < mT{l + 6)al} . 

Using the union bound and the Chemoff bound we have 

P({ft^O™},{y;!l,^U„}) < P(K|2>mr(l + 5)a2)+P({/i^(D^},3l™(l)) 

{|v^ - /iAx(a;)|2 < mT{l + 5)alY) (24) 



Let us consider one term in the sum in the last line of ( 24 1 for a given message uj and given channel h, 
averaged over the random coding ensemble. Noticing that for vectors a and b and T > we have 

{|a + b|2 < r, |b|2 < r} C {|a|2 < 4r}, 
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we can bound this probability as 

P(^||vS^-/iAx(w)P < mT{l + S)a^,Y < mT{l + 5)a^^\h^ (25) 

< Pf |/iAx(w)|2 < 4mT{l + 6)al 



'-^ P{p'\h\'^X<'^rnT{l + 6)\h) 

<p-mT[i~,yU^ (26) 

where (a) follows from the fact that for the randomly generated codewords, x = l^^i^)]"^ / ^ is a central 
chi-squared random variable with mean 2mT and 2mT degrees of freedom, and the last line follows by 
letting = p~'^ and from the fact that the chi-squared cdf satisfies P{x < u) = 0{u'^^) for small u 
and P{x ^ u) = 0(1) for large u. Summing over the p^^^"^ — 1 messages w / 1 and integrating with 
respect to the pdf of over the set 0^, we obtain 

P {{h i 0™}, {y^O i Itm}) < / p-mnX-v\,^rMT 

= p~~^-^'\ {21) 
where, from a standard application of Varadhan's lemma, we have 

dm{r)= inf {v + mT[l - u]+ - r MT] . (28) 

The domain of v over which the infimum is calculated is non-empty only for r < j^. This means that 



the set of channels for which the probability in (23 1 has a polynomial decrease is empty for r > ^ 
and therefore dm{r) = cxd for r > jj. For r < ^ it is not hard to see that for all T > 1 we have 
dm{f) = 1 — Comparing dm{r) with dm{r) we see that the former dominates the latter for all 
r S [0,1]. It follows that for our relay bounded distance decoder and the Gaussian random coding 
ensemble P(M = m) < 



So far we have shown that in the upper bound (17 1 the terms P(£r|M = m) are asymptotically 
negligible and the terms P{JA = m) are upper bounded by the same exponent of the outage probability 
based, infinite T, case. It remains to show that the terms P{E,\E,r,^ = m) have exponent dm{r) given 



in ( [T0| ), ( [TT| ), and the proof will be complete. 

We consider the GLRT decoder at the destination. This decoder ignores the knowledge of the a priori 
distribution of M and treats it as a deterministic unknown parameter. Hence, we are in the presence of a 
compound channel formed by the family of m-SC component channels, without any a priori knowledge 
of M. 
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Again, without loss of generality we assume message 1 is transmitted. While for the sake of notational 
simplicity, we omit the explicit conditioning with respect to it is understood that the relay has perfect 
knowledge of the transmitted information message. We omit also the explicit conditioning with respect 
to CSIR and denote simply by y since no ambiguity is possible at this point. Hence, the likelihood 
function p(yQ^\uj,m,gi,g2) shall be denoted simply by p(y|a;,m). The pairwise error probability for 
some Lo ^ 1 can be upper bounded as follows: 



P(l ^ a;|M = m) = P ( maxp(y|l, m') < maxp(y| 



Lo, m 



M. = m] 



< P ( p(y|l, m) < maxpfylo;, m') M = m) 

\ m' / 



M 



P\ IJ {p(y|l,"^) < p(y|a;,m')} 



.m'=l 



M = m 



M 



< P (p(y|l,m) < p{y\uj,m')\M = m) . 



(29) 



m'=l 



We shall analyze separately the terms inside the above sum, averaged over the random coding ensemble. 
Define the event 

^ I p(y|i,"i) 

We first analyze the probability of the event £i, which we then use to compute P(£). Assuming M = m, 
the actual received signal is 



m 

yo 



M 
J m 



5ix-o(l) + Wo"^ 



(30) 



We consider the case m! > m and leave the case m' < m to the reader, since it follows in an almost 
identical manner. Define the partial codeword differences Ax^^q = xy^Q(l) — xy^Q(a;), Ax^^^ = xy^^(l) — 
^tm{^\ Axf^, = x^^,(l) - xf^,(a;), and Ax,^^, = x,^^,(l) - The error event £i can be 

written as 

£i = { |5iHAx-o|' + 2Re{5i(wS^)HAx-o} + |5iAx-;, + 52x-^(l)|' + 
+2Re {(w™')H [5iAx™;„ + <72x;!:„(l)] } + |<7iAxf„, + ffaAx,^^,]' + 
+2Re {(w^,)^ [5iAxf„, + 52Ax,^^,] } < } . (31) 
After a little algebra, we obtain the compact expression 

£i = |2Re{z^^vir} < -|zp| , 
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where z is defined as 

5iAx^o 

For given codebooks Xg, X^, the variance of 2Re{z'^w} is equal to 2|zpc7^, which leads to 



P(£i|Xs,Xr,3V[ = m,gi,g2) < Q 
Define the following notation, 



= [Xs,i{l) Xs,i{uo) Xr,i{l) Xr,i{uj)\^, 1 < i < MT, 



and 



t A rtTAT ifT iT ^ p4Mrxl 
? — [Cl?2 ■ ■ '^MtJ S 

It can be verified that |zp = ^'^M^, for a block diagonal M of the form 

Ml 



M = 



where for 1 < k < mT, 



Mfc = l^il 



1 


-1 








-1 


1 

































for mT + 1 < A; < m'T, 



and for m'T +l<k< MT, 



Mi 



blp 


-^\9i? 


929*1 





-\9i\' 


\9i? 


-929*1 





9i9l 


-9i9l 


\92? 

















\9i? - 


-\9i? 


929* 


-929* 


-\9i? 


\9i? - 


-929*1 


929*1 


9i9l ■ 


-919*2 


\92\' - 


-\92? 


-9i9l 


9i9l - 


-\92\' 


\92? 
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It turns out that the matrices have rank 1, for all 1 < A; < MT. It follows that the eigenvalues of each 
M/; are tr(Mfc), 0, 0, 0. We now average P(£i|Xs, X^, m,, (71, (72) over the ensemble of random Gaussian 
codebooks. In order to do so, we use the following well-known result on the characteristic function of 
Hermitian quadratic form of complex Gaussian random variables (briefly, HQF-GRV). 

Lemma 3: [19, Appendix 4] The characteristic function of the HQF-GRV A = z^Fz, where z ~ 
e>J"(z, R) is given by 

^ ^^ irrr r am exp{-sz^F{I + sKF)-^z) 
cI>M.) = E [exp(-.A)] = . 

Therefore, 



= $ 



e-NIV(4^?) 



_ 1 

det(I + f M) ' 

Explicitly, we have 

1 1 1 



det(I + fM) [l + l\g,\^r^ [l + f(2biP + b2P)]("^'-'")^ [l + + M')]^^'-""'^^ 

= I I (32) 

[l + p\9i\T^ [l + p{\9i\' + \92\^)r-^^' 



We notice that (32i does not depend on m', at least in the exponential equality sense. Summing over all 
m' = 1, . . . , M and over all messages w / 1, we eventually can bound the average probability of error 
of the GLRT decoder conditioned on M = m and on 171 , g2 as 

P(£|£„M = m,5i,52) < ^c^|M = m, (71,52) 

M 

< X] X] -P(Sl|"^,5'l,52) 

uj=/=l m'=l 

< 'V'^ ^ (33) 

- [i + PkPl-^ [i + p(kP + l»P)l<"-""^ 

Next, we shall evaluate the diversity exponent of P(£|£r,M = m) = Egj^gjP(£|£r, M = m,gi,g2)]. In 
order to do so, we separate the outage event from the no-outage event. Define the outage event of the 
m-SC as 

Am = {(M - m) max{[l -Qi] + ,[1 -02]+} + -ai]+ -rM < 0}. (34) 
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Then, 

P(£|£r,M = m) = P{8,,Am\Er,M = m)+P{8,,Am\lr,'M = m) 

(35) 

Recall that 



where dm{r) is evaluated in ( [T0| ), ( [TT] ). In order to evaluate P{E,Am\&r,^ = "i), we use (33 1 and write 
the exponential inequality 

P(£|£„M = m,5i,52) < 

where 

gm(ai,a2,?') = (M - m) max{[l - ai]+, [1 - 02]+} + m[l - ai]+ - rM. (36) 
Therefore, using again Varadhan's lemma, we obtain 

P(£,X„^|£^,M = m) = yg-^^.-W, 

where 

dG,m{r) = inf {ai + 02 + T^mCai, 02, . (37) 

5m(ai,Q2,r) > 

ttl) CK2 ^ 

The above infimum is achieved when 5m(ai,a2,r) | 0, yielding 

dG,m{r) = dm{r). 

This concludes the proof of Theorem |2] ■ 
Remark. The proof of Theorem [2] is not only conceptually appealing, but also reveals a few very 
important and often neglected features that should be taken into account in the design of a DDF scheme. 
First, the proof sheds light on the fact that the relay must make its decision based not only on the outage 
condition, but also on the reliability of the decoding decision. Then, it shows also that despite the fact 
that the destination does not know the relay decision time, there is no need for an explicit protocol that 
provides this side information. In Appendix |l| we analyze a simpler decoder, nicknamed relay activity 
detector, based on separated detection of the relay decision time by treating the codewords as random 
Gaussian signals (i.e., ignoring the structure of the code). We show that such a simple "energy detector" 
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is optimal if we let T ^ oo first, and then consider the high SNR performance, but it is dramatically 
suboptimal if we do the limits in the reverse order. In fact, for any finite T, the relay activity detector 
yields a constant error probability, that does not vanish with SNR. 

C. Computing the DMT and comparisons 

Obtaining a closed-form solution to the DMT expression in Theorem [T] appears to be intractable. We 
plot in Fig. |4] values of d\j{r) for M = 2,5, 10 and 20 in comparison with the optimal DMT of the 
DDF protocol (corresponding to M = oo). 
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Fig. 4. The DMT of the DDF channel with finitely many decoding decision times. 



With increasing M, d\j{r) is seen to approach the optimal tradeoff very rapidly. For practical code 
design, even a relatively small value of M is therefore expected to have close to optimal performance in 
terms of diversity. 
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Remark. The authors in [16] consider a related problem, where T ^ oo and the relay is restricted to a 
finite number of decision times (say N). These time instants coincide with the end of blocks {Mj}^^^, 
with 1 < Ml < • • • < Mn < M \/ j (notice: with this notation, in our case we would have N = M and 
Mj = j). Further, define Mq = 0, Mn+i = M, and a set of "waiting fractions" by fj = ^■ 

Thus 

/O = < /i < • • • < /tv < fN+l = 1. 

In [16], it is proved that for any fixed N no set of waiting fractions yields a DMT curve that dominates 
all others. Then, a particular set of waiting fractions are chosen that yield for any fixed N a DMT curve 
that is not uniformly dominated |^ by any other protocol with the same number of decision times N. The 
resulting DMT is derived and it is summarized by the following lemma from [16]. 

Lemma 4: [16] For the DDF protocol with a given number of decision times, let /f = | and 

/f = i—^s, ■ for K j < JV, 

then no set of fractions uniformly dominates {/Jj^i. Further, the DMT con^esponding to the set of 
fractions is given by 

r 

(38) 



N 



+ 



dP{r) = 1 - r + 

A few interesting observations can be made about this result. As it is remarked in [16], the DMT obtained 
through {fj} is not asymptotically optimal, i.e., it does not converge to the optimal DMT of the DDF 



protocol as ^ oo. Indeed, it is evident from (38 1 that dp{r) consists of two straight line segments, 
say XLi for < r < and £^2 for < r < 1. As ^ cxd, £1 and £2 can at best be tangential 
to the curved part of the DMT of the DDF protocol (i.e., the 0.5 < r < 1 region) in Q. In particular, 
the optimal value of the DMT of the DDF protocol d*{0.5) = 1 is never approached even in the limit 
by (F{r). In contrast, the DMT d\,^{r) derived in this paper is asymptotically optimal. As the number 
of decoding points increases, d\i{r) dominates over #(r) for almost all values of r, and is strictly less 
for only an exceedingly small range of values of r. Asymptotically, it is clear that the only set of points 
where ^^(r) dominates d\.^{r) is a very small set of points around the point where #(r) is tangent to 
the curved part of the DMT of the DDF protocol. 

''According to the definition in [16], protocol A uniformly dominates protocol B if, for any multiplexing gain r, dA{r) > dsir). 
A protocol that is not uniformly dominated by any other protocol is said to be Pareto-optimal. 
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IV. DMT OPTIMAL CODES FOR THE SINGLE RELAY DDF CHANNEL 

The authors in [15] used the ensemble of random Gaussian codes of asymptotically large block- lengths 
to show the achievability of the DMT of the DDF protocol. Subsequently, a construction of codes derived 
from cyclic division algebras (CDA) was shown to achieve the DMT of the DDF channel for arbitrary 
number of relays [28]; i.e., they achieve the corresponding tradeoff for a particular number of decoding 
instants. As we increase the block-length and the number of decoding instants, the DMT of these codes 
tends towards the optimal DMT of the DDF protocol given in Q. In a recent submission, [29], the 
authors present a division-algebraic construction based on the Alamouti code that is similar in flavor 
to the construction to be presented in this paper. However, for the codes in [29], the parameter T is 
fixed to 2; on the contrary, we will see that our code construction is valid for arbitrary values of T 
including the special case of T = 1, and is hence a minimum delay construction. Decoding these codes 
involves sphere or sequential decoding [31], [32] over a large dimensional lattice. It is hence of interest 
to construct codes that achieve the DMT of the DDF protocol and permit low complexity decoding. Since 
our construction is of minimum delay, the dimensionality of the lattice to be sphere decoded is half that 
of the corresponding case in [29]. 

In order to completely specify a signalling scheme (Xs, (/;, X^) for the DDF channel, we need to define 
the following: 

1) A code Xs that is used by the source. 

2) A causal decoding decision function (/){■, •) : (C,C^^^) {1,2, . . . ,M}, that dictates when the 
relay attempts to decode the source's transmission based on the S-R channel gain h and the signal 
Yr received at the relay. In particular, if 4'{h, y^) = M, the relay will not attempt to decode the 
transmission of the source. If cl){h,yr) = m, 1 < m < M, then the relay attempts to decode the 
transmission of the source upon completion of the block. Because of the causality constraint, 
we assume that the output of (p at time m depends only on h (CSIR of the relay) and on yj"g. 

3) A code Xr used by the relay. In the following, we will only consider the case that the relay 
implements the Alamouti-DDF scheme [16] given in ([6]); hence X, is the same as X^ upto coordinate 
permutations, sign change and conjugation. 

A. Design tradeoffs 

Despite the importance of the decoder at the destination, as evidenced in the proof of Theorem 2, in 
this section we take a shortcut and we do not treat the the GLRT decoder explicitly. For the sake of 
simplicity, our simulations assume a genie-aided destination, ideally informed of the relay decision time. 
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The main focus of this section is on the design of the codebook and on the efficient implementation 
of the relay decoding decision function, in order to trigger the relay transmission only when decisions 
are reliable. 

Choosing a good relay decoding decision decision function (j) is critical to ensure good performance: a 
conservative (j) that makes the relay wait for too long before decoding results in low relay error probability 
P(£r), but increases the destination error probability P(£, £r ) since the relay has less time to help the 
destination. Vice-versa, a that is too aggressive and makes the relay decode too early yields low 
P(£,£r.) but results in a large P(£r), since the blocklength of the signal observed at the relay is too 
short to cope with atypical noise. We shall also see through simulations that undetected decoding errors 
at the relay have a huge impact on performance, since the relay ends up jamming the destination with 
high probability. We will present our choices of Xg and (j) in the following two subsections. 

B. Approximately universal Xs 

The equivalent channel resulting from the use of the Alamouti scheme for the relay code is a parallel 
channel Q with statistically dependent fading coefficients. We will choose Xs to be a code of length MT 
that is approximately universal over the parallel fading channel. A code that is approximately universal 
over the parallel channel (a notion introduced in [21]) meets the DMT over any parallel channel. Such 
a code has an error probability that decays exponentially with p for all parallel channel gains such that 
the corresponding mutual information is larger than the coding rate, i.e., for all channel gains in the 
no-outage region. Therefore, such an approximately universal code X^ meets the DMT of the relay DDF 
channel for any M. This means that, for any fixed rate R and sufficiently large SNR, the decay of error 
probability with SNR of our code (with finite T) exhibits the same slope of outage probability. However, 
the "gap from outage" (i.e., the horizontal distance in dB between the outage probability and the actual 
probability of error) is not captured by the DMT optimality and in practice it may be very large, thus 
making a DMT-optimal scheme totally useless for practical purposes. We shall discuss ways to close this 
gap in the next section, by an appropriate choice of the relay decision function (f). 

We may obtain approximately universal Xs from either suitable algebraic lattices [23], [24] or from 
permutation codes through UDMs [21], [22]. In the following we briefly review these constructions. 

1) Rotated QAM codes from algebraic lattices: Let L be an AfT-dimensional extension of Q(z) and 
let the Galois group Ga/(L|Q(z)) = {(Ti, . . . ,(Tmt}- Denote the ring of integers of L as Ojl and let J 
be an ideal of Ol- Let A'^l|Qi(j)(") denote the algebraic norm from L to Q(2). We define the code X^ as 
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follows: 



G 8 > , 



where 8 is some finite subset of J. X^ has the desirable property of a "non-vanishing" product distance, 
since we have for each x G X. that 



MT 

n 



MT 



since the norm A'l|(Q(j)(-) of an algebraic integer in L is an element of This non-vanishing product 
distance property ensures that X^ is approximately universal over the parallel channel [21], [29]. 
It can be verified that X^ can equivalently be rewritten as a lattice code, i.e., 

X, = {Gb|bGS}, (39) 

for suitable G G £MTxMT .3 ^ Z[z]^^^. A particular choice of G and "B that is good in terms 
of shaping consists of constructing G to be unitary and S to be a set of points in contained in 

a hypercube that is centered around the origin |^ For the algebraic details regarding the construction of 
such unitary G, see [23], [24]. Notice also that choosing the information set S to be a bounded subset 
of Z[i]^''^^ corresponds, in practice, to choosing information symbols from a QAM alphabet, which is 
appealing for practical implementation. The rate of X^ in this case is 

R = , bpcu. 



Parameters for simulations: In the simulations to follow in Sec. IV-C we construct the matrix G using 
the cyclotomic construction given in [23]. For M = 4 and T = 1, G is a complex 4x4 matrix, or 
equivalently a real 8x8 matrix. We choose 23 to be a cartesian product of Q^-QAM alphabets, 

T, = {a + tb\ - Q + 1 < a,b < Q - 1, a,b odd}^^^ , 

for some even integer Q. Thus 1231= Q^^^^. For example, by choosing Q = 4 with M = 4 and T = 1 
we obtain a rate of i? = 4 bpcu. 

""Notice however that choosing G unitary is optimal only when we are constrained to use a linear map to encode the 
information vector onto the code symbols. An alternate approach is to use a constellation carved out of a dense lattice in R" 
and employ a non-linear sphere encoder and a mod-A MMSE-GDFE lattice decoder; this has been shown to yield significant 
performance improvements over unitary shaping [26], [27]. For simplicity of exposition, we will restrict our attention to the 
case of linear encoding in this paper. 
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2 ) Permutation codes from UDM: Approximately universal code construction from UDM were intro- 
duced in [21] and a general algebraic construction valid for any number of sub-channels was provided 
in [22]. 



Definition 1: [22] Let n and L be some positive integers and let g be a prime power. The L matrices 
Aq, . . . , Ai_i over Fg of size n x n are (L, n, g')-UDMs if for every (/cq, . . . , ki-i) such that < ki < 
n V £, J2e=o — (.J2e=o ke) x n matrix composed of the first ko rows of Aq, the first ki rows 

of Ai, . . ., the first /cl-i rows of Ai_i has full rank. 



The authors in [22] provide an algebraic construction of such (L, n, g)-UDMs for any L < q + 1. 
It is shown in [21] that an approximately universal permutation code for the parallel channel with L- 
branches can be obtained from (L, n, g)-UDMs, in the following manner. Assume that we have to transmit 
2n information symbols from ¥q. We encode independently n-symbols each onto the I and Q sub- 
channels. Let u G Fg denote the first n input information symbols. Map the sequence of F^ symbols 
{ Aiu, A2U, . . . , A^u} componentwise onto a L-length vector of ^"-PAM symbols, and transmit the 
components on the I sub-channel. The next n information symbols are similarly encoded and transmitted 
on the Q sub-channel. The rate of such a permutation code is 

2nlogg 
R = bpcu. 

In our case, we set L = MT to obtain codes for the DDF channel. 



Parameters for simulations: The simulations involving permutation codes in Sec. IV-C for M = 4, 
T = 1 are derived from (4, 4, 4)-UDMs, leading to R = A bpcu. In order to completely specify the code, 
we need to provide the mapping to PAM symbols that was used. We construct the Galois field F4 using 
the primitive polynomial + X + 1. Thus any element in F4 may be associated with a polynomial 
biX + h^, where the bi are either or 1, and X is a primitive element. Hence we may also associate 
each element in F4 with the binary string hibQ. In order to map an F| vector v (which is one of the 
AjU considered previously) to the PAM alphabet, first concatenate the binary strings corresponding to 
Vi G F4, z = 1, 2, 3, 4 to obtain an 8-length binary vector b. This binary vector is mapped to the centered 
256-PAM alphabet by computing 2 YJi=o &i2* - 255. 
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C. Decoding decision function (p and Forney 's decision rule 

A first choice for 0, which we shall denote (j)i, would be to allow the relay to decode as soon as the 
mutual information between the source and the relay exceeds MTR, i.e., 

, /,^ • f,. r MR 

= mm <^ M, 



log(l + |/i|V) 

where p' is the SNR of the source-relay Unk. This rule is asymptotically optimal for large T, in fact it 
coincides with the rule in the original formulation of the DDF protocol ([5]). For finite T, (pi is suboptimal 
since it ignores the actual signal received by the relay, i.e., the atypical behavior of the noise may dominate 
the error probability for short block lengths. As an illustration of the inefficacy of this decision function 
at finite block-length, consider the simulation results in Fig. [5] In the simulations to follow, we choose 
to be a rotated QAM code. We will subsequently compare these results with those obtained by choosing 
Xs to be a permutation code, and observe very similar trends. We consider ML decoding at both the 
relay and destination for all the simulations in this sub-section. Further, we assume that the source-relay 
link SNR p' is 3 dB above the SNR p of all other links in all our simulations (the X-axis on all our plots 
is the SNR p in dBs). 

The simulations in Fig. |5] are for the case when Xs is a rotated QAM code, T = 1, M = 4 and R = 4 
bits per channel use (bpcu). The seemingly strange non-monotonic behavior of the error probability 
can be understood by the following intuitive explanation. At low SNRs, the relay hardly ever triggers 
before m = 4, resulting in P(£) being dominated by the error probability at the destination P(£,£r), 
and hence P(£) is large and decreasing. Then, there is an intermediate region of SNR where the relay 
attempts to decode, but it decodes incorrectly with high probability and causes significant interference 
at the destination. Thus P{E) is dominated by the relay error probability -P(£r), and increases in this 
region. For sufficiently large SNR, the relay decodes correctly with high probability and therefore helps 
the destination, thus providing the required cooperative diversity (slope of the overall error curve at high 
SNR). However, this happens at very large gap from the outage probability, that can be regarded as a de- 
facto optimal performance also for finite-length codes and not asymptotically high SNR. This simulation 
reveals a phenomenon that has been scantily treated in previous works: the effect of decoding errors at 
the relay clearly dominates the overall performance. This fact has often been neglected since it is neither 
captured by the T —>■ oo case, where the atypicality of the noise has no effect and triggering the relay 
based on the outage event is exact, nor by the DMT formulation, that does not capture the gap from 
outage, but just the asymptotic error probability curve slope. 

One immediate remedy consists of adopting a conservative relay decoding decision function, which 
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Fig. 5. Xs is a rotated QAM code, T = 1, AI — 4, R — 4 bpcu, relay implements <^i(-)- 



we will denote as (1)2, defined as 



min < M. 



MR 



+ 1 



log(l + |/i|V) 

Simulation results using this strategy are shown in Fig. [6j once again for the case of being a rotated 
QAM code, T = 1, M = 4 and = 4 bpcu. 

In this case, the relay error probability is so low that no errors were recorded in our Monte Carlo 
simulation (no such curve is shown in Fig. [6]l. The downside of this strategy however is that since the 
relay is over-conservative, it helps the transmitter too late, and the overall error probability P{E.,S.r) 
suffers from significant degradation with respect to outage probability. 

In [16], the authors prove that there is no loss in DMT for the DDF protocol using the Alamouti type 
relay by using the following relay decoder function cp^: 

MR 



■ ... 

mm < M , max < — , 



log(l + |/i|V^ 
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Fig. 6. Xs is a rotated QAM code, T = 1, M — 4, R — 4 bpcu, relay implements (j}2{-)- 



i.e., the relay is allowed to decode and transmit only after the codeword from the source is at least 
half-way through. Simulation results of this protocol shown in Fig. [7] reveal that this scheme also suffers 
from a significant penalty at high SNRs due to the P(£r) term dominating the overall error probability. 

In [16], the authors use an extra layer of cyclic -redundancy check (CRC) coding to enable the relay 
to perform error detection (and wait for another round if incorrect decoding is detected) - while this 
strategy is effective in reducing P{E-r), there is an inherent loss of rate. In fact, it is shown in [12] for 
the MIMO-ARQ channel that CRC is suboptimal in terms of DMT since the undetected error probability 
must decrease with SNR at least with the same exponent of error probability itself, and this requires a 
number of CRC bits that grow linearly with log SNR. The same consideration applies here. Hence, we 
wish to avoid the use of CRC in order to detect if the relay decodes in error. 

We present a novel strategy to enable error detection at the relay without further layers of coding at 
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Fig. 7. Xs is a rotated QAM code, T = 1, Al — 4, R — 4 bpcu, relay implements <^3(-). 



the transmitter. We make use of a criterion introduced by Forney in [14] in the context of retransmission 
(ARQ) protocols to decide whether the decoder is in error or accept the decoding outcome. Here, we 
apply this criterion to the relay decoder, that we refer to as Forney's decision rule. Interestingly, Forney's 
decision rule is similar in essence to the bounded distance decoder that we have considered in the proof 
of Theorem [2] However, while the bounded distance decoder is easy to analyze but only asymptotically 
optimal, Forney's decision rule has the remarkable property of striking an optimal balance between the 
probability of undetected error at the relay and the probability of rejecting the decision and waiting for 
the next slot (probability of decision "erasure", in the language of [14]). To the best of our knowledge, 
this decoding decision rule was not proposed before in the context of relay cooperative communication. 
We define the decoding decision function (j)F{h,yr) using Forney's decision rule as follows: 
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1) If (j)i{h) = M, don't decode and set (j)F{h,yr) = M (worthless trying to decode if we are in 
outage). 

2) If 4'i{h) = m < M, decode after the block and apply the following threshold test. Let lo 
denote the outcome of the relay decoder. Accept the decision and trigger the transmission mode if 



>r, 



(40) 



where r a suitable threshold set empirically for each SNR. If the threshold is not exceeded, wait 
for the next block and repeat this step until either the threshold is exceeded or m = M. 
(pF is found to be extremely effective in suppressing the error probability at the relay without being too 
conservative and refraining from helping the destination when possible. Simulation results for the case 
when Xs is a rotated QAM code, T = 1, M = 4 and R = 4 bpcu are shown in Fig. |7] 
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Fig. 8. Xs is a rotated QAM code, T = 1, Al — 4, R — 4 bpcu, relay implements <^f(-)- 
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The error probability is within 1 dB from the corresponding outage probabiUty. 
Fig. |9] shows the results when we choose to be a permutation code, with T = 1 , M = 4 and R = 4 
bpcu. In this figure we considered two cases: the case of a genie aided relay, where a genie provides 




5 10 15 20 25 30 35 

SNR (dB) 

Fig. 9. Xs is a permutation code, T — 1, M = 4, R = 4 bpcu, relay implements (f>F{-)- 

the relay with the source message as soon as the mutual information exceeds MTR, and the case where 
the relay performs minimum distance decoding in conjunction with Forney's rule. The results are similar 
in flavour to the case where is a rotated QAM code, with the permutation code losing 1 dB with 
respect to the rotated QAM code. Notice that despite the DMT optimality, these codes may perform 
differently depending on their shaping and coding gain. In this case, it is apparent that the rotated QAM 
code outperforms the permutation code, although they achieve the same diversity. 
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D. Low complexity MMSE-GDFE Lattice Decoding 

As we saw in ([39]), the choice of rotated QAM codes makes Xs a lattice code. Let A be the 2MT- 



dimensional lattice corresponding to the generator matrix G in p9\ . MMSE-GDFE lattice decoding has 
been shown to be DMT optimal for the class of lattice space-time (LaST) codes over MIMO channels 
[11], and has also been shown to perform well for deterministic structured LaST (S-LaST) codes [26], 
[27]. Let V{A) denote the fundamental Voronoi cell of an n-dimensional lattice A (See [20] for definitions 
relating to lattice theory). The lattice quantization function is defined by 

QA(y) = argmin|y- A| 
AeA 

and the modulo-lattice function is given by 

y mod A = y - QA(y). 

In the sequel, we will work with the real channel model which is equivalent to ([T]l, ([2]) and Q, obtained 
by writing signals explicitly in terms of their real and imaginary parts (see for example [1] for details 
regarding the equivalence between real and complex channel models). By slight abuse of notation, we will 
refer to the real equivalent of the complex vectors and matrices Xs,y,yr, A,G using the same notation. 
In order to use reduced complexity MMSE-GDFE lattice decoding [11], information needs to be encoded 
onto cosets of a sublattice As of A, as follows. Choose A^ = QA, where Q G Thus |A/As| = Q'^^^'^. 
Let C denote the set of points {Gz | z G Z'^^'^}, where Zq = {0,1, . . . ,Q — 1}. The transmitter selects 
a codeword c G C, generates a pseudo-random dither signal u with uniform distribution over V(As), and 
obtains the transmitted codeword 

Xs = [c — u] mod As- 

Thus information is encoded onto the cosets of the partition A/A^: is a coset representative of the 
coset onto which the information is encoded, and belongs to the fundamental Voronoi region of A^. Let 
denote the coset of A^ in A onto which the information corresponding to message to is encoded. From 
([T]), Q and Q the (real equivalent) received signals y,. = y^Q for M = m at the relay and y^ = y*Q 
at the destination may be written as 

y,. = H^Xs + V (41) 

and 

ys = Hxs + w, (42) 
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where H^. G (^2mTx2MT ^ (^2MTx2MT (lejjote the (real) equivalent channels at the relay and 

destination, and v and w denote the (real equivalent) noise at the relay and destination respectively. 



Notice from ( |4T) that decoding at the relay corresponds to solving an under-determined system of linear 
equations. We follow the approach of [34] in this case, where it was shown how MMSE-GDFE lattice 
decoding may be used to efficiently solve under-determined systems of linear equations. We focus on 
decoding at the relay in the sequel, the decoder at the destination is identical upon replacing the relevant 
signals and parameters at the relay with those at the destination. Let F and B denote the forward and 
backward filters of the MMSE-GDFE (see for example [11] for the definition of these matrices in terms 
of Hr and the relay SNR). The relay produces the modified observation 



and computes 



y; ^ Fy, + Bu, 



are min |y' — BGz 



2 



The relay then decides in favor of the coset that contains the point 

[Gz] mod As. 



In order to work with the lattice coding and decoding scheme, Forney's decision rule ( |40| ) needs to be 
modified to take into account the fact that information is encoded onto cosets as against points in the 
lattice. Encoding information into cosets is equivalent to consider a modulo-A^ channel with output y'^ 
modulo BA^. Hence, the relevant likelihood function is given by 

p{y'^\uj,h)= ^ p^(y^ - B(c^ + As)|/i) 

with domain y^ G V(BAs), where is a coset representative of C^^ and where pq,{-w\h) denotes the pdf 
of the noise induced by the modulo-A^ channel with the dithering, that is, w = yj. — Bx^ where is the 
transmitted signal. Unfortunately, p{ij is difficult if not impossible to determine in closed form. However, 
a good practical choice that works well for good shaping lattices is to let p^^j be a Gaussian pdf with 
i.i.d. components ~ !N(0,cj^/2) (see [11] for a theoretical asymptotic justification of Gaussianity in this 
context). Then, the proposed modification of Forney's decision rule ( |40l ) for the lattice MMSE-GDFE 
decoder is given by: accept Q at time m if 

ExeA. pM -B{co + Xs)\h) ^ ^ ^^^^ 

J2cuj^Q Ea^gA, Pwiy'r - B(C<^ + Xs)\h) - 

where, again r is a suitable threshold set empirically for each SNR. The infinite sums at numeration and 
denominator can be safely truncated by restricting to a number of most likely lattice points, which may 
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be done as follows. Generate a list of !N = {BAj : Aj G A}^^ of lattice points of the lattice generated by 
BG that are closest to y^. Such a list may be generated, for example, by using a standard lattice decoder 
with a sufficiently large search radius. For any given message uj, check whether Xi belongs to the coset 



+ As. If yes, then this point makes a contribution towards the numerator of ( [43] ), else towards the 
denominator. If there exists Q for which the corresponding ratio crosses the threshold r then accept the 
decision, otherwise reject and wait for the next slot. 

The modified Forney's rule in (|43]l is seen to be quite effective for the case when MMSE-GDFE lattice 



decoding is performed at both the relay and the destination. The simulations in Fig. 10 compares the 



performance of rotated QAM codes with T = l, M = 4, R = A bpcu that use Forney's rule d40b with 



ML decoding and modified Forney's rule ( |43] ) with MMSE-GDFE lattice decoding. The low complexity 
lattice decoder tracks the ML performance within 1 dB. 
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Fig. 10. Xs is a rotated QAM code, T = 1, M — 4, R = 4 bpcu, relay implements Forney's or modified Forney's rule. 
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V. Conclusion 

We presented a characterization of the achievable DMT of the single-relay DDF protocol for finite 
block length. Our achievability proof yields insight on the design of actual coding schemes. In particular, 
we stressed the importance of a relay decoding function that check the reliability of its decision, in order 
not to jam the destination. Also, we showed that the destination need not be aware of the relay decision 
time, since a GLRT-based decoder achieves optimal DMT performance. This may have some impact on 
the design of practical DDF protocols, since it essentially shows that no complicated side-information 
channel needs to be implemented in order to explicitly notify the destination about when the relay starts 
transmitting. 

In our proofs, we considered a bounded distance decoder at the relay and an ensemble of random 
Gaussian codes. Then, we constructed practical and very simple codes based on lattices (rotated QAM 
constellations) and permutation codes. We demonstrated via simulation that the impact of undetected 
decoding errors at the relay may be huge. In order to tackle this problem, we have proposed the use of 
Forney's decision rejection rule, that proves to be very effective. 

Finally, we have investigated the use of a reduced complexity MMSE-GDFE lattice decoder and 
modulo- A lattice codes, that yields the well-known low-complexity decoding even at the relay. It should 
be remarked that the relay invariably has to decode an undetermined linear system, therefore standard 
sphere decoding algorithms fail. 

A few comments relating to future work are in order here. We have not exploited the low-complexity 
quantization-based decoding approach for permutation codes [21], owing to the fact that it is not com- 
pletely clear as to how we can apply a good decision rejection rule at the relay in this case. Another 
interesting problem relates to code design for finite but large T; in this case, neither the constructions 
presented in this paper, nor those in [28], [29] are fully controllable in terms of coding gain and both 
entail very high decoding complexity. Concatenation of short codes based on rotated QAM constellations 
or permutation codes with some form of outer coding (along the lines of [25]) may prove to be appropriate 
for this scenario. 

Appendix I 

Separated Relay Activity Detection 

In this Appendix we treat a side problem. An intuitive low-complexity scheme for detecting the relay 
decision time consists of treating M as a random parameter, and use ML detection by disregarding the 
structure of the channel codes. Intuitively, the destination should be able to detect a transition in the 
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received power, between the listening phase and the transmission phase of the relay. This approach is 
referred to as separated Relay Activity Detection (RAD), since the decision time and the source codeword 
are separately decoded, in contrast with the GLRT decoder analyzed in the proof of Theorem [2] We shall 
show that separated RAD yields no performance loss when we consider the limit of T ^ cxd. On the 
contrary, it is suboptimal and actually may perform very poorly when limits are taken in the reverse 
order, that is, for each finite T we consider the performance as SNR gets large. 

We assume that the source uses an i.i.d. random Gaussian code and the relay implements the Alamouti- 
DDF scheme. As before, let M denote the decision time. An ML decision time detector that is ignorant 
of the codebooks treats the channel input as a random Gaussian signal. The detection rule is given by 

M = argmaxp(y|M = m,gi,g2). 

m 

where p(y|M = 771,51,(72) shall be denoted in the following simply by p{y\m) for simplicity, and it is 
given by 

MylH = . ...^T ^xpf- , (44) 



1 / -lv*^P 

exp „ „ '^"^,L , , . (45) 



{vr [(IffiP + \g2\')p + " V(l5iP + \92\')P + 1 

Suppose M = m, we define the pairwise error event 

r /. A fp{y\'m') 

{m — > m I = < — — I — r- > 1 



I p{y\ 



mj 



The detector error probability is lower bounded by 



P{M / M) > max P{m m), 
and is upper bounded by the union bound 

P(M / M) < (M - 1) max P{m m). 

Hence, we shall study the diversity exponent of P{m — > m!) for general m ^ m'. If this does not depend 
on m, m' we have determined the diversity exponent of the separated RAD. 

A. Infinite block-length 

If M = m and T —>■ 00, the law of large numbers yields the almost sure convergence of the limits: 

^|yn-iP ^ l<n<m 
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and 

^lYn-il' ^ (bi|' + l52|>+l, m + l<n<M. 

Thus, for large T we have 

p{y\m) ^ exp {-MT - mT log [7r(|ffi|V + 1)] - (M - m)riog [7r((|5ip + \g2\'')p + 1)] } . 

Consider the case m! > m (the other case follows in the same way and it is omitted for brevity). We 
have 

p{y\m') ^ exp|-mr- (M-m')r- (m'-m)T^^^^^J^^-^^^^- 
l 1511/5 + 1 

-m'Tlog [7r(|5i|V+ 1)] - (M - mO^^log [^r (dffip + \g2?)p+l)\ } • 
After some simplifications, the pairwise error probability for T — >^ oo is given by 

P(m ^ m'\gi,g2) = P(l-Xi + logXi > 0) , (46) 

where we let 

^ (|ffl|' + |g2p)/0+l 

Since logx < x — 1 V a; > 0, we see that {m m'} can occur only if = 0, which is an event 
of measure 0. Therefore, we conclude that P{m m!) J, for any fixed p, d& T ^ 00. This shows 
that for the infinite T case, even a very simple separated RAD scheme at the destination yields perfect 
knowledge of the relay decision time without any need of a side information channel that involves some 
protocol overhead. 

B. Finite block-length 

We now fix T to be an arbitrary finite value and study the diversity exponent of P{m m') as 
p ^ 00. Again, we consider only the case m! > m. The likelihood function for the hypothesis m! when 
M = m is given by 

p{y\m') = exp (-rn'T log [vrd^i IV +1)] - (M - m')Tlog7r [d^ip + |92p)p + l] 

|yo"^P + |y;;^? \y^'? \ 
Wp + 1 {\9i? + \g2?)p+i)' 

After some algebra, we find that 

P{m ^ m'|5i, 52) = p[x< ^"^'^^^^ log(l + X2)) , (47) 
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where 



X 



m 



l + {\9i\' + \92\')p 

is a central chi-squared random variable with 2T{m' — m) degrees of freedom and mean T(m' — m), 
and we define 

y \92?P 



As an aside, notice that ^ log(l + x) is a decreasing function of x that is less than 1 for all x > 0, and 
approaches 1 for x | 0. Therefore, the term ^^^^^^^^ log(l + X2) in (47) is always strictly less than 



E[x] = {m' — m)T for all \g2\ > 0. Therefore, as an application of the the large deviation theorem [17], 
we find that P(m — > m') [ exponentially with T for all finite p and \g2\ > 0. Thus, we recover in a 
more rigorous way the result obtained before by letting T — > 00 directly in the detector decision metric. 

Returning to the case of finite T, we have using well-known properties of the chi-squared distribution 
that 

P(X <U) = ^—L^uim'-m)T ^ ^^^^m' -m)T+l^ 

[[m' — m)l j! 

for small u, and obviously 

P(X <u) = 0(1) 

when u = j3{m' — m)T for some constant /3 > 0. Fix an arbitrary < (3 < 1. From what was said 
before, there exists an X2 > such that ^ log(l + X2) = P- Hence, consider the event 

&{p,P) = {X2<X2} 

= {|52|V<^2(l + |5l|V)}- (48) 



It is clear that for all (31,(72) £ ^{p^P), the pairwise error probability P{m m'\gi,g2) in (47 1 is 
exponentially equivalent to a constant as p ^ 00, i.e., 

P{m ^ m'\gi,g2) = (51, 52) G £(/?, /?)• 

Averaging with respect to 51,52, and using the standard variable substitution \gi\'^ = |(72p = /0~"^ 

we find 



p-"i-°^dai(ia2, 



where, from (48 1, 



£' = {(ai, 02) G : 1 - a2 < [1 - ai] + } • 
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Using Varadhan's lemma, we find that tlie diversity exponent of the pairwise error probability is given 
by 

A = inf {«! + 02} = 0, 

(ai,Q:2)e£' 

since the point ai = 0, a2 = belongs to the boundary of the region £'. 

This shows that for any finite T, a separated RAD scheme based on optimal (Maximum Likelihood) 
detection of the relay decision time M that ignores the codebook structure and treats the transmitted 
signals as random processes is very suboptimal. In fact, the probability of error of such a scheme is 
constant with SNR and eventually will dominate the performance of the whole destination decoder. 

In some way, this result shows that the joint detection of the relay decision time and of the information 
message is necessary in order to achieve the optimal (infinite T) DDF DMT. 
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